A Rotated Array Clustered Extended Hypercube Processor, the RACE-H Processor
نویسندگان
چکیده
The RACE-Hypercube Processor is a highly parallel signal processor with fourteen degrees of freedom in selecting parallel operations. The fourteen degrees of freedom include the ability to select the 1) number of very long instruction word (VLIW) slots, 2) number and type of application specific instructions, 3) number and type of application specific processing element (PE) hardware assists, 4) number of PEs, 5) operation as a single issue uni-processor, 6) operation as a variable-length indirect VLIW (iVLIW) uni-processor, 7) operating each PE as a single issue PE, 8) operating each PE as a variable-length iVLIW PE, 9) operation with 32-bit packed data, 10) operation with 64-bit packed data, 11) operation with parallel independent PE conditional execution, 12) type of PE-to-PE communications including single cycle concurrent mesh, torus, hypercube, hypercube-complement communications, 13) operation with independent PE threaded array operations, and 14) background operations on the scalable direct memory access (DMA). A brief description of the RACE-H architecture is provided along with a description of the scalable DMA subsystem, programming tools, and a brief performance evaluation. The RACE-Hypercube architecture allows the achievement of up to 1.024 trillion bytes/sec at a relatively low clock frequency of 250MHz with short execution unit pipelines and an architecture that is programmer friendly.
منابع مشابه
Clustered Extended Hypercube Processor , the RACE - HTM Processor
The RACE-Hypercube Processor is a highly parallel signal processor with fourteen degrees of freedom in selecting parallel operations. The fourteen degrees of freedom include the ability to select the 1) number of VLIW slots, 2) number and type of application specific instructions, 3) number and type of application specific PE hardware assists, 4) number of PEs, 5) operation as a single issue un...
متن کاملDesign and Implementation of Field Programmable Gate Array Based Baseband Processor for Passive Radio Frequency Identification Tag (TECHNICAL NOTE)
In this paper, an Ultra High Frequency (UHF) base band processor for a passive tag is presented. It proposes a Radio Frequency Identification (RFID) tag digital base band architecture which is compatible with the EPC C C2/ISO18000-6B protocol. Several design approaches such as clock gating technique, clock strobe design and clock management are used. In order to reduce the area Decimal Matrix C...
متن کاملHierarchical Gate-Array Routing on a Hypercube Multiprocessor
Gate-arrays are the most common design style for semicus-tom VLSI integrated circuits. An important part of the gate-array design process is the routing of wires between the logic elements, which is an extremely compute-intensive operation. This paper presents an algorithm for routing gate-arrays that uses a hypercube connected parallel processor to provide the necessary computation power. In o...
متن کاملProcessor Array Requirements for Advanced Image Processing: Theory and Experiment
Two topics are described. The first is the use of connection or reachability matrices as pictorial aids in the investigation of fixed-geometry processor arrays. Matrices are shown for mesh and hypercube arrays and also for a connection topology which is based on hashing. The second topic is an explanation why Hopfield-style associative memories exhibit large variations in the recall strength of...
متن کاملUltra-Low-Energy DSP Processor Design for Many-Core Parallel Applications
Background and Objectives: Digital signal processors are widely used in energy constrained applications in which battery lifetime is a critical concern. Accordingly, designing ultra-low-energy processors is a major concern. In this work and in the first step, we propose a sub-threshold DSP processor. Methods: As our baseline architecture, we use a modified version of an existing ultra-low-power...
متن کامل